Implementing Parser Metarules that Handle Speech Repairs and Other Disruptions

نویسندگان

  • Mark G. Core
  • Lenhart K. Schubert
چکیده

Mixed-initiative dialogs often contain interruptions in phrase structure such as repairs and backchannel responses. Phrase structure as traditionally de ned does not accommodate such phenomena, so it is not surprising that phrase structure parsers are ill-equipped to handle them. This paper presents metarules that specify how the instantiations of phrase structure rules may be restarted or interrupted, with allowance for interleaved speech. In the case of interleaved speech or backchannel responses, the metarules allow syntactically separate constituents to interleave or to straddle each other. In the case of repairs, the metarules operate on the reparandum (what is being repaired) and alteration (the correction) to build parallel phrase structure trees: one with the reparandum and one with the alteration. Consider the partial utterance, take the banum the oranges. The repair metarule would build two VPs, one being take the banand the other being take the oranges. The introduction of metarules simpli es the notion of an utterance since a sentence interrupted by an acknowledgment such as okay can still be treated as a single utterance formed around the interrupting acknowledgment. Together, metarules and phrase structure rules specify the structures that should be accommodated by a parser for mixed initiative dialogs. A dialog parser should also maintain a dialog chart that stores the results of syntactic and semantic analysis of all the dialog seen so far. This dialog chart will be a shared resource eliminating the need for maintenance of a separate representation of dialog structure by a dialog manager.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handling Speech Repairs and Other Disruptions Through Parser Metarules

Mixed-initiative dialogs often contain interruptions in phrase structure such as repairs and backchannel responses. Phrase structure as traditionally defined does not accommodate such phenomena, so it is not surprising that phrase structure parsers are ill-equipped to handle them. This paper presents metarules that specify how phrase structure rules may be restarted or interrupted (including ov...

متن کامل

A Syntactic Framework for Speech Repairs and Other Disruptions

This paper presents a grammatical and processing framework for handling the repairs, hesitations, and other interruptions in natural human dialog. The proposed framework has proved adequate for a collection of human-human task-oriented dialogs, both in a full manual examination of the corpus, and in tests with a parser capable of parsing some of that corpus. This parser can also correct a pre-p...

متن کامل

A Model of Speech Repairs and Other Disruptions

Most dialog systems ignore the problem of speech repairs and editing terms (urn, uh, etc.) or use preprocessing techniques to eliminate them from the input. These systems also typically enforce a strict turn-taking protocol that does not allow speakers to interrupt each other. This paper describes a parser that can process input containing editing terms, speech repairs, and second speaker inter...

متن کامل

Word Buffering Models for Improved Speech Repair Parsing

This paper describes a time-series model for parsing transcribed speech containing disfluencies. This model differs from previous parsers in its explicit modeling of a buffer of recent words, which allows it to recognize repairs more easily due to the frequent overlap in words between errors and their repairs. The parser implementing this model is evaluated on the standard Switchboard transcrib...

متن کامل

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998